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•  Motivations 

-  Uncertainty  in  ontology  representation,  reasoning  and  mapping 

-  Why  Bayesian  networks  (BN) 

•  Overview  of  the  approaeh 

•  Translating  OWL  ontology  to  BN 

-  Representing  probabilistic  information  in  ontology 

-  Structural  translation 

-  Constructing  conditional  probability  tables  (CPT) 

•  Ontology  mapping 

-  Formalizing  the  notion  of  “mapping” 

-  Mapping  reduction 

-  Mapping  as  evidential  reasoning 

•  Conclusions 
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•  Uncertainty  in  ontology  engineering 

-  In  representing/modeling  the  domain 

•  Besides  A  subclasOfB,  also^  is  a  small  subset  ofB 

•  Besides  A  hasProperty  P,  also  most  objects  with  P  are  in  A 

•  A  and  B  overlap,  but  none  is  a  subclass  of  the  other 

-  In  reasoning 

•  How  close  a  description  D  is  to  its  most  specific  subsumer 
and  most  general  subsumee? 

•  Noisy  data:  leads  to  over  generalization  in  subsumptions 

•  Uncertain  input:  the  object  is  very  likely  an  instance  of 
class  A 
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-  In  mapping  concepts  from  one  ontology  to  another 

•  Similarity  between  concepts  in  two  ontologies  often  cannot 
be  adequately  represented  by  logical  relations 

-  Overlap  rather  than  inelusion 

•  Mappings  are  hardly  1  -to- 1 

-  If  ^  in  ontol  is  similar  to  B  in  onto2,  A  would  also  be  similar  to 
the  sub  and  super  elasses  of  B  (with  different  degree  of 
similarity) 

•  Uncertainty  becomes  more  prevalent  in  web  environment 

-  One  ontology  may  import  other  ontologies 

-  Competing  ontologies  for  the  same  or  overlapped  domain 
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•  Why  Bayesian  networks  (BN) 

-  Existing  approaches 

•  Logic  based  approaches  are  inadequate 

•  Others  often  based  on  heuristic  rules 

•  Uncertainty  is  resolved  during  mapping,  and  not 
considered  in  subsequent  reasoning 

-  Loss  of  information 

-  BN  is  a  graphic  model  of  dependeneies  among  variables: 

•  Structural  similarity  with  OWL  graph 

•  BN  semantics  is  compatible  with  that  of  OWL 

•  Rich  set  of  efficient  algorithms  for  reasoning  and  learning 

UMBC 


an  Honors  University  in  Maryiand 


5 


•  Directed  acyclic  graph  (DAG) 

-  Nodes:  (discrete)  random  variables 

-  Arcs:  causal/influential  relations 

-  A  variable  is  independent  of  all  other  non-descendent 
variables,  given  its  parents 

•  Conditional  prob.  tables  (CPT) 

-  To  each  node:  |  .)  where  ^ ^  is  the  parent  set  of  x^ 

•  Chain  rule; 


-  Joint  probability  as  product  of  CPT 
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OWL-BN 
translation 

concept 
_  mapping 

OWL-BN  translation 

•  By  a  set  of  translation  rules  and 
proeedures 

•  Maintain  OWL  semanties 

•  Ontology  reasoning  by  probabilistie 
inferenee  in  BN 


Ontology  mapping 

•  A  parsimonious  set  of  links 

•  Capture  similarity  between  eoneepts 
by  joint  distribution 

•  Mapping  as  evidential  reasoning 
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•  Encoding  probabilities  in  OWL  ontologies 

-  Not  supported  by  current  OWL 

-  Define  new  classes  for  prior  and  conditional  probabilities 

•  Structural  translation;  a  set  of  rules 

-  Class  hierarchy:  set  theoretic  approach 

-  Logical  relations  (equivalence,  disjoint,  union,  intersection...) 

-  Properties 

•  Constructing  CPT  for  each  node: 

-  Iterative  Proportional  Fitting  Procedure  (IPFP) 

•  Translated  BN  will  preserve 

-  Semantics  of  the  original  ontology 

-  Encoded  probability  distributions  among  relevant  variables 
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Encoding  Probabilities 


•  Allow  user  to  specify  prior  and  conditional  Probabilities.  ^ 

-  Two  new  OWL  classes:  "PriorProbObj"  and  "CondProbObj" 

-  A  probability  is  defined  as  an  instance  of  one  of  these  classes.  ■ 

•  P(A):  e.g.,  P(Animal)  =  0.5 

<prob:PriorProbObj  rdf:ID=’'P(Animal)’'>  ® 

<prob :  has  V  ariablexrdf :  value>&ont ;  Animal</ rdf :  valuex/ prob :  has  V  ariable> 
<prob :  hasProb  V  alue>0 . 5  </ prob :  hasProb  V  alue> 

</prob:PriorProbObj> _ 


•  P(A|B):  e.g.,  P(Male|Animal)  =  0.48 

<prob:CondProbObjT  rdf:ID=’T(Male|Animal)"> 

<prob :  hasConditionXrdf :  value>&ont ;  Animal</ rdf:  valuex/ prob :  hasCondition> 
<prob :  has  V  ariableXrdf :  value>&ont ;  Male</ rdf :  valuex/ prob :  has  V  ariable> 
<prob :  hasProb  V  alue>0 . 5  </ prob :  hasProb  V  alue> 

</prob:CondProbObjT> _ 
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structural  Translation 


•  Set  theoretic  approach 

-  Each  OWL  class  is  considered  a  set  of  objects/instances 

-  Each  class  is  defined  as  a  node  in  BN 

-  An  arc  in  BN  goes  from  a  superset  to  a  subset 

-  Consistent  with  OWL  semantics 

<owl:Class  rdf:ID=“Human"> 

<rdfs:subclassOf  rdf:resource=’'#Animar'>  I 

<rdfs:subclassOf  rdf:resource=’'#Biped'’> 

</owl:Class> 


RDF  Triples: 

(Human  rdfitype  owbClass) 

(Human  rdfsisubClassOf  Animal) 
(Human  rdfsisubClassOf  Biped) 

Translated  to 
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Structural  Translation 


Logical  relations 

-  Some  can  be  encoded  by  CPT  (e.g..  Man  =  Human  H  Male) 


Human 

True  50.0 
False  50.0 

Male 

True  50.0 
False  50.0 

Man 

True  50.0 
False  50.0 

Human 

Male 

True 

False 

True 

True 

100.00 

0.000 

True 

False 

0.000 

100.00 

False 

True 

0.000 

100.00 

False 

False 

0.000 

100.00 

-  Others  can  be  realized  by 
adding  control  nodes 

Man  Human 
Woman  c  Human 
Human  =  Man  u  Woman 
Man  n  Woman  =  0 

auxiliary  node:  Human_l 
Control  nodes:  Disjoint,  Equivalent 
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Constructing  CPT 


•  Imported  Probability  information  is  not  in  the  form  of  CPT 

•  Assign  initial  CPT  to  the  translated  structure  by  some 
default  rules 

•  Iteratively  modify  CPT  to  fit  imported  probabilities  while 
setting  control  nodes  to  true. 

-  IPFP  (Iterative  Proportional  Fitting  Procedure) 

To  find  Q{x)  that  fit  QiEj),  . . .  Q(Ei)  to  the  given  P(x) 

•  Qo(^)  =  -PW;  then  repeat  Q^(x)  =  Q{Ej)/  Qj.jiEj)  until 

converging 

*  Qco  (-^)  is  an  I-projection  of  P  (x)  on  Q(Ej),  . . .  Q(Ej) 
(minimizing  Kullback-Leibler  distance  to  P) 

-  Modified  IPFP  for  BN 
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Formalize  the  notion  of  mapping 

Mapping  involving  multiple  concepts 

Reasoning  under  ontology  mapping 

Assumption:  ontologies  have  been  translated  to 
BN 
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Formalize  The  Notion  of  Mapping 


•  Simplest  case:  Map  concept  in  Onto^  to  in  Onto^ 

-  How  similar  between  E’  and 

-  How  to  impose  belief  (distribution)  of  E^  to  Onto^ 

•  Cannot  do  it  by  simple  Bayesian  conditioning 

P(x|  E^)  =  ^  PiM  E^)P{E^  I  E^)  similarity(£'^,  E^) 

-  Onto'  and  Onto^  have  different  probability  space  (Q  and  P) 

.  g(£/)  ^  p(£/) 

•  New  distribution,  given  in  Onto':  P*{x)  ^  'E  P {x\E^)P{E^) 

-  similarity(£'^,£'^)  also  needs  to  be  formalized 
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Formalize  The  Notion  of  Mapping 


•  Jeffrey’s  rule 

-  Conditioning  cross  prob.  spaces 

-  P*is  an  I-projection  of  P(x)  on  Q{E^)  (minimizing  Kullback- 
Leibler  distance  to  P) 

-  Update  P  to  P*  by  applying  Q{E^)  as  soft  evidence  in  BN 

•  similarity  (£'^,  E^) 

-  Represented  as  joint  prob.  R(E\E^)  in  another  space  R 

-  Can  be  obtained  by  learning  or  from  user 

•  Define 
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Reasoning  With 


Using  similarity E^): 
R*{E^) 

=  R*{e,  E^)IR*{E‘) 
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Reasoning  With  Multiple  niap(£'^,£'^) 


Mapping  Reduction 


•  Multiple  mappings 

-  One  node  in  BN  1  can  map  to  all  nodes  in  BN2 

-  Most  mappings  with  little  similarity 

-  Which  of  them  can  be  removed  without  affecting  the  overall 

•  Similarity  measure: 

-  Jaccard-coefficient:  sim(E^,E^)  =  P(E^  n  E^)IR(E^  vjE^) 

-  A  generalization  of  subsumption 

-  Remove  those  mappings  with  very  small  sim  value 

•  Question:  can  we  further  remove  other  mappings 

-  Utilizing  knowledge  in  BN 
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Conclusions 


Summary 

-  A  principled  approach  to  uncertainty  in  ontology 
representation,  reasoning  and  mapping 

Current  focuses: 

-  OWL-BN  translation:  properties 

-  Ontology  mapping:  mapping  reduction 

Prototyping  and  experiments 
Issues 

-  Complexity 

-  How  to  get  these  probabilities 
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